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ABSTRACT 

We compute the bispectrum of the 2dF Galaxy Redshift Survey (2dFGRS) and use it to mea¬ 
sure the bias parameter of the galaxies. This parameter quantifies the strength of clustering of 
the galaxies relative to the mass in the Universe. By analysing 80 million triangle configura¬ 
tions in the wavenumber range 0.1 < k < 0.5 /iMpc ” 1 (i.e. on scales roughly between 5 and 
30/i _1 Mpc) we find that the linear bias parameter is consistent with unity: &i = 1.04±0.11, 
and the quadratic (nonlinear) bias is consistent with zero: 62 = —0.054 ± 0.08. Thus, at least 
on large scales, optically-selected galaxies do indeed trace the underlying mass distribution. 
The bias parameter can be combined with the 2dFGRS measurement of the redshift distortion 
parameter (3 to yield = 0.27 ± 0.06 for the matter density of the Universe, 

a result which is determined entirely from this survey, independently of other datasets. Our 
measurement of the matter density of the Universe should be interpreted as il rn at the effective 
redshift of the survey (z = 0.17). 

Key words: galaxies: clustering- cosmology: large-scale structure of Universe, cosmological 
parameters 


1 INTRODUCTION ated in the early Universe. Comparison with theoretical predictions 

offers the chance to test models of the generation of the pertur- 
Clustering of mass in the Universe is believed to be a result of am- bations, as well as putting important constraints on cosmological 
plification by gravitational instability of small perturbations gener- 
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parameters, which control the growth rate of the perturbations. A 
fundamental limitation on such a comparison has been that theoret¬ 
ical models predict the clustering properties of the mass in the Uni¬ 
verse, and yet we have few direct measures of mass observationally. 
More readily observable is the distribution of luminous objects such 
as galaxies, so to compare with theory one has to determine, or as¬ 
sume, the relationship between the clustering of mass and the clus¬ 
tering of galaxies. In general one will expect these to differ, because 
the efficiency of galaxy formation may depend in some nontrivial 
way on the underlying mass distribution. The idea that structures 
may be ‘biased’ tracers of the mass distribution goes back to Kaiser 
(1984), who explained the high clustering strength of Abell clusters 
as due to their forming in high-density regions of the Universe. In 
addition, observations indicating that different types of galaxy clus¬ 
ter differently (e.g. Dressier 1980; Postman & Geller 1984; White, 
Tully & Davis 1988; Hamilton 1988; Lahav, Nemiroff & Piran 
1990; Lahav & Saslaw 1992) show that they cannot all be unbi¬ 
ased tracers of the mass. Bias became an attractive way to recon¬ 
cile the low velocities of galaxies with the high-density Einstein-de 
Sitter model favoured in the 1980s (e.g. Davis et al. 1985), but after 
the Cosmic Background Explorer (COBE) determined the ampli¬ 
tude of primordial fluctuations on large scales (Smoot et al. 1992), 
the ‘standard’ biased Cold Dark Matter (CDM) model became less 
popular. With the advent of more detailed datasets in the cosmic 
microwave background (CMB) and large-scale structure, it is pos¬ 
sible to investigate and constrain a wider range of galaxy formation 
models, and an unknown bias relation adds uncertainty to the pro¬ 
cess. 

Since the efficiency of galaxy formation is not well understood 
theoretically, it makes sense to try to measure it empirically from 
observations. When the perturbations are small (or on large, lin¬ 
ear scales), it is difficult to do this: there is a degeneracy between 
the unknown amplitude of the matter power spectrum P(k ) and 
the degree of bias, b, defined such that the galaxy power spectrum 
is P g (k) = b 2 P(k). In principle, b may be a function of scale, 
through the wavenumber k. At later times (or on smaller scales), 
however, the degeneracy is lifted by nonlinear effects. One fea¬ 
ture of nonlinear gravitational evolution is that the overdensity field 
5(x) = (p(x) —p)/p becomes progressively more skewed towards 
high density. In principle skewness could also arise from non- 
Gaussian initial conditions; in practice this can be neglected (Verde 
et al. 2000), since CMB fluctuations are consistent with Gaussian 
initial conditions (Komatsu et al. 2001; Santos et al. 2001). One 
can thus hope to exploit the gravitational skewness, but skewness 
could equally well arise from biasing, e.g. from a galaxy formation 
efficiency that increased at dense points in the mass field. It is nev¬ 
ertheless possible to distinguish these two effects by considering 
the shapes of isodensity regions. If the field is unbiased, then the 
shapes of isodensity contours become flattened, as gravitational in¬ 
stability accelerates collapse along the short axis of structures, lead¬ 
ing to sheet-like and filamentary structures (e.g. Zeldovich 1970). 
If the galaxy field is highly biased with the same power spectrum, 
however, the underlying mass field is of low amplitude, and thus ex¬ 
pected to be close to the initial field, which is assumed to be Gaus¬ 
sian. These fields do not have highly-flattened isodensity contours, 
as bias does not flatten the contours; for example Eulerian bias pre¬ 
serves the contour shape. Thus there is a difference which could 
be detected, for example, by studying the three-point correlation 
function. In this paper, we exploit this effect in Fourier space rather 
than real space, by analysing the bispectrum: (5ki5k 2 5k3), where 
5k is the Fourier transform of the galaxy overdensity field. The the¬ 
ory for the bispectrum is set out in Fry (1994), Hivon et al. (1995), 


Matarrese, Verde & Heavens (1997), Verde et al. (1998), Scocci- 
rnarro et al. (1998), Scoccimarro, Couchman & Frieman (1999) and 
Scoccimarro (2000). 

The galaxy survey we use is the Anglo-Australian Tele¬ 
scope 2 degree field Galaxy Redshift Survey (Colless et al. 
2001), as compiled in February 2001. It was created with the 
2dF multi-fibre spectrograph on the Anglo-Australian Telescope 
(Lewis et al. 2001), and currently consists of over 200,000 
galaxies with redshifts up to about z = 0.3, broadly in two 
regions centred near the south and north gala ctic poles. See 
http : //www, mso . anu . edu . au/2dFGRS/ for further de¬ 


tails. It is the first survey which is large enough to put tight con¬ 
straints on the bias parameter, as previous surveys are too shal¬ 
low or too sparse. In this paper, we use 127,000 galaxies from the 
February 2001 compilation of the catalogue, truncated at 0.03 < 
a < 0.25. 

The outline of the paper is as follows: in Section 2 we review 
the theory of growth of the bispectrum through gravitational insta¬ 
bility, and discuss briefly the effects of redshift-space distortions; 
in Section 3 we illustrate our method of measuring the bias pa¬ 
rameter. This method uses a new estimator of the bias parameter, 
which allows us to analyse many millions of k-vector triplets, thus 
drastically improving the signal to noise. In Section 4 we test the 
performance of the method. Finally in Section 5 we present our re¬ 
sults and in Section 6 we discuss the implications of these results. 
An Appendix presents and describes in detail our new estimator of 
the bias parameter. 


2 THE BISPECTRUM IN REAL AND REDSHIFT SPACE 

The statistic we use to measure the bias of the galaxies is the bis¬ 
pectrum B, which is related to the three-point correlation function 
in Fourier space. For the mass, this is defined by 

(^k^ka^ka) = (27r) 3 B(ki,k2,k 3 )5 D (ki + k 2 + k 3 ) (1) 

where 5k = / d 3 x5(x) exp(— ik ■ x) is the Fourier transform of 
the mass overdensity 5(x) = p(x)/p — 1 and 5° is the Dirac delta 
function, which shows that the bispectrum can be non-zero only if 
the k- vectors close to form a triangle. 

The power spectrum P is similarly defined by 

(5 k 5 k /> = (27r) 3 P(fc)5 D (k + k'). (2) 

Analogous relations hold for the power spectrum and bispectrum 
of the galaxy distribution. We assume that the mass overdensity is 
a Gaussian random field initially, as closely predicted by inflation¬ 
ary early universe models. Thus, at asymptotically early times the 
bispectrum is zero by symmetry. As gravitational instability devel¬ 
ops, the field becomes asymmetric, because nonlinear effects skew 
the density field to high densities. In this way, a non-zero bispec¬ 
trum develops. In the mildly nonlinear regime, we use second-order 
perturbation theory to compute the expected bispectrum. To second 
order (in the overdensity 5) the Fourier coefficients develop a non¬ 
linear component which is proportional to 5 2 , so the leading order 
term in the bispectrum grows like 5 4 . Since the 2dFGRS is not a 
survey of mass density, to interpret the bispectrum measured from 
the survey we must make some assumption about the distribution 
of mass relative to the distribution of galaxies. To date, this uncer¬ 
tainty in the relationship between the mass and the galaxy distribu¬ 
tion has placed a limitation on the usefulness of galaxy catalogues 
as a probe of cosmology. We make the assumption that the galaxy 
overdensity field 5 g is related to the underlying mass overdensity by 
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some deterministic function, which we expand in a Taylor series as 
(cf. Fry & Gaztanaga 1993) 


i=0 


b_df_ 

i\ 


(3) 


proposed. We use the form 

f 3 s (ki, k2, ks) = (B12 + B23 + P31) x ( 6 ) 

^ _l_ dykiUi a p ^ ^ ayk^nlo-p ^ ^ ^ 


We must keep terms up to i = 2, since these enter in the bispec¬ 
trum at the same level as second-order perturbation theory growth 
terms, and we ignore higher-order terms. In order for S g to have 
zero mean, there is a (calculable) bo term, but we ignore it as it con¬ 
tributes only to k = 0 ; 61 is the linear bias parameter, and 62 is the 
quadratic bias parameter. A non-zero 62 would indicate nonlinear 
biasing of galaxies with respect to mass. Both of these parameters 
are estimated in this paper from the 2dFGRS. 

In real space, the two effects of nonlinear growth and nonlin¬ 
ear bias contribute terms to a non-zero bispectrum: 


B(ki, k 2 , ks) 


P 9 (fci)P 9 (fc 2 ) 


J(ki,k 2 ) , b 2 
bi b 2 


+ cyc. 


(4) 


where there are two additional cyclic terms (2,3) and (3,1). De¬ 
tails of the theory leading to : ji may be found in e.g. Matarrese, 
Verde & Heavens (1997). We assume here that the galaxy power 
spectrum is P 9 (fc) = b\P(k) (see Heavens, Matarrese & Verde 
1998 for discussion of this point); J is a function that depends on 
the shape of the triangle in k— space, but only very weakly on cos¬ 
mology (e.g. Bouchet et al. 1992; Bouchet et al. 1995; Catelan et al. 
1995). Note that we assume a deterministic bias; other authors (e.g. 
Scherrer & Weinberg 1998; Dekel & Lahav 1999; Taruya, Koyama 
& Soda 1999) have investigated stochastic bias, where there is a 
random component to the relationship between S and 8 g . In the 
case of stochastic bias the bispectrum has still the form of equation 
(3) but the function J is modified into J' in such a way that when 
the correlation coefficient of the stochastic bias r goes to unity (i.e. 
bias becomes deterministic), J’ —> J. Theoretical considerations 
suggest that r ~ 1 on scales of interest (e.g. Tegmark & Peebles 
1998; Blanton et al. 2000; Seljak 2000). 

In redshift space, both the power spectrum and the bispectrum 
are modified by redshift-space distortions, arising because the dis¬ 
tance estimator (the redshift) is perturbed by peculiar velocities. 
These distortions are radial in nature, and can be analysed, at some 
expense in complexity, in radial and angular basis functions (e.g. 
Fisher, Scharf & Lahav 1994; Heavens & Taylor 1995; Ballinger, 
Heavens & Taylor 1995; Hamilton 1998; Tadros et al. 1999). Here, 
we adopt the distant-observer approximation (Kaiser 1987), and as¬ 
sume that nonlinear effects can be modelled by an incoherent small- 
scale velocity field, characterised by the pairwise velocity disper¬ 
sion Up. The large-scale in-fall leads to distortions which depend 
on the redshift distortion parameter /3 = /b\, where Q m is 

the matter density parameter. Assuming in addition an exponential 
distribution for the pairwise velocity the combined effect gives the 
power spectrum in redshift space (denoted by subscript s) 


Ps( k) = 


P(fc)(l + /^ 2 ) 2 

1 + k 2 fj, 2 ap /2 


(5) 


(e.g. Ballinger, Peacock & Heavens 1996; Hatton & Cole 1998), 
where fx is the cosine of the wavevector to the line of sight, which is 
a fixed direction in the distant observer approximation. Other mod¬ 
ifications have been suggested, such as a Gaussian, or exponential 
one-particle velocity dispersion, which yield different functional 
forms for the redshift distortion. Note that er p is usually written, 
as here, implicitly divided by the Hubble constant. The bispectrum 
is modified similarly, and again various modifications have been 


where 
B 12 = 


(1 + Pui)( 1 + dth.) x 


Ker(ki, k 2 ) b 2 
bi b 2 


P ff (k 1 )P 9 (k 2 ) 


(7) 


and the kernel function Ker is J modified for redshift space (see 
Verde et al. 1998 equation (13) for the formula); ay is an adjustable 
parameter which is shape-dependent, and must be calibrated from 
simulations. Scoccimarro, Couchman & Frieman (1999) propose 
an alternative]^] for the denominator of ( 6 ), namely 

(l + a 2 s [kfni + k 2 nl + fc 2 p 2 ]cr 2 / 2) 2 ( 8 ) 


where the one-particle dispersion cr„ = u p /\/2 if the small-scale 
velocities are incoherent, and again the parameter as needs to be 
calibrated for different triangle shapes and cosmologies. We find 
that the formula ( 6 ) recovers the true bias parameter in an ensem¬ 
ble of simulated biased 2dFGRS catalogues with smaller scatter 
than (| 8 [). We see from ( 6 ) and (7) how the bispectrum can allow 
us to measure the bias parameters. The left-hand side is potentially 
observable, and we can hope to constrain 61 and b 2 by considering 
triangles of different shape (and hence different Ker). 


3 METHOD 


The previous Section shows the theoretical model for the bispec¬ 
trum, and its dependence on the two parameters bi and b 2 \ note 
that, apart from these two parameters, the bispectrum depends on 
observable quantities such as /3, a p , and P g . The real-space galaxy 
power spectrum is obtained from the angle-averaged redshift-space 
power (cf. Verde et al. 1998): 


P(k) 


Ps{k) 


' (a 2 pk 2 -f3)0 

&pk A 


+ 


2/3 2 , V2{k 2 ol-2P)\ _ 1 (ka p \ 

2>u 2 k 2 + tan \f2j 


-1 


(9) 


The real-space power spectrum obtained this way agrees well with 
the APM power estimated by Baugh & Efstathiou (1994) and Efs- 
tathiou & Moody (2001). The input catalogue is based on a revised 
and extended version of the APM galaxy catalogue (Maddox et al. 
1990a). 

The bispectrum and power spectrum data come from trans¬ 
forming the galaxy distribution as follows. The galaxies are 
weighted with the optimum weight for measuring the power spec¬ 
trum (Feldman, Kaiser & Peacock 1994), which also minimises 
the variance of higher-order correlation functions (Scoccimarro 
2000). The weight is ut(r) = 1/[1 + Pon(r)], where n(r) is 
the average number density of galaxies at position r, and Po is 
the power spectrum to be estimated. For speed reasons, Po was 
fixed at 5000 h~ 3 Mpc 3 so that a Fast Fourier transform could be 


* Note that in equation 38 of Scoccimarro, Couchman & Frieman (1999) 
there is an extra power of 2 outside the square brackets, which we omit in 
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Figure 1. Measured (redshift-space) 2dFGRS dimensionless bispectrum from the SGP and NGP for the two chosen configurations. The dotted line shows the 
perturbation theory prediction for &i = 1, & 2 = 0 while the dashed and dot-dashed lines show the shot noise contributions. The error bars are obtained via 
Monte Carlo simulation of 16 mock 2dFGRS catalogues (see text for details). 


employed. This is optimal for mimimising the variance at k ~ 
0.1 h Mpc -1 , whereas our signal comes from wavenumbers with 
a smaller power, but in fact altering Pq hardly changes the results. 


3.1 Estimating the bispectrum 

In this Section we describe how we estimate the bispectrum from 
the data, taking into account the survey shape, selection function 
and shot noise. 

We follow Feldman, Kaiser & Peacock (1994) and Matarrese, 
Verde & Heavens (1997), and transform the field 

T(r) s Aw(r) [n(r) - an r (r)] (10) 

where A is a constant to be determined, n r (r) is the number density 
of a random catalogue with the same selection function as the real 
catalogue, but with 1/a times as many particles. If we set A = 
J 22 1 ^ 2 , where 

Iij = J (i 3 rw'(r) rr'(r) (11) 

(Matarrese, Verde & Heavens 1997), then the power spectrum may 
be estimated from 

(|-Fk| 2 ) = -Fs(k) + -p-{l + a) (12) 

1 22 

and the bispectrum from 

<F kl F k2 F k:j ) = -^{S s ( kl , k2 ,k 3 )+ (13) 

I22 

j 1 [P 9 ( kl ) + P 9 ( k2 ) + P 9 ( ks )] + (1 - a 2 ) 1 -^} . 

J33 J33 J 

An underlying assumption is that the power spectrum is roughly 
constant over the width of the survey window function in k space. 
Because of the rather flat geometry of the survey regions, and the 
holes due to star drills, this criterion is not satisfied in detail. In 
practice, we have used multiple mock catalogues (Cole et al. 1998) 
with the same selection criteria as the survey regions to check 
that this assumption does not feed through into a biased estimate 
of the bias parameters. Also, the power spectrum and bispectrum 
estimates are convolved with the window function This can lead 


to changes in shape from convolution, and erroneous interpreta¬ 
tion of correlated noise as real features in the power spectrum. 
These effects are, however, not important in the wavenumber range 
(0.1 < k < 0.5 h Mpc -1 ) which we use for the bispectrum analysis 
(Percival et al. 2001). 

To compute the F k , we generated random catalogues with ap¬ 
proximately 5 times as many particles as the real catalogue, and 
analysed the North and South Galactic Pole regions (NGP & SGP) 
separately. We ignored the random fields present in the 2dFGRS 
as these complicate the window function and add very little infor¬ 
mation for the current analysis. Fast Fourier transforms were per¬ 
formed on a 512 x 512 x 256 grid which encompassed all the 
particles, leading to a grid spacing of about 1 ft -1 Mpc. 

3.2 Choice of triangles 

We use the real parts of F kl F ka F ka as our data, for triangles in k 
space (i.e. where lti + k 2 + k 3 = 0). Each triangle allows us to es¬ 
timate a linear combination of the parameters natural to this analy¬ 
sis: ci = I /61 and c 2 = & 2 /& 1 . through equations ( 6 ) and (7). Note 
that we must use triangles of different shape (and hence different 
Ker[ k i, k: 2 ]) to lift the degeneracy between nonlinear gravity and 
nonlinear bias. As explained in the introduction, this is equivalent 
to analysing the shapes of structures, which are different in the two 
cases. 

There is clearly a huge number of possible triangles to inves¬ 
tigate (many millions), and we are faced with a problem of how 
to choose the triangles to analyse. The problem is that triangles 
which have a wavevector in common will be correlated, through 
cross-terms in the 6 -point function. A likelihood analysis as origi¬ 
nally suggested in Matarrese, Verde & Heavens (1997) and Verde 
et al. (1998) with millions of correlated data is unfeasible, so we 
take a different approach. We use two sets of triangles of differ¬ 
ent configurations: one set with one wavevector twice the length of 
another, and another set with two wavevectors of common length. 
For the former, calibrations with mock catalogues give av = 1.8, 
and the latter av = 1-0. We set a lower limit to the wavenum¬ 
ber range of A: = 0.1 h Mpc -1 , to avoid the effects of convolution 
with the window function (Percival et al. 2001). We set an upper 
limit of k = 0.5/t Mpc -1 where, for the 2dFGRS power spec¬ 
trum, shot noise begins to dominate the signal so that little further is 
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Figure 2. Error on the linear bias parameter from ACDM mock galaxy catalogues in the SGP (left) and NGP (right). The average bias in the estimator is 
consistent with zero for the SGP: —0.01 ± 0.03, but shows a small bias of 0.10 it 0.04 for the NGP. The sample rms of 0.11 for the SGP and 0.16 for the 
NGP are used in the analysis of the 2dFGRS. 
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gained by increasing the limit. In addition, we impose a constraint 
k < 0.35 /iMpc -1 for the second configuration choice, where per¬ 
turbation theory for this configuration begins to break down. This 
leaves us with 80 million triangles. Two considerations motivate us 
to consider ’’only” these 80 million triangles:;) Adding more highly 
correlated triangles complicates the analysis significantly and does 
not add much signal and ii ) more importantly, only these two con¬ 
figurations have been extensively tested against fully non-linear N- 
body simulations. In fact not only perturbation theory may break¬ 
down on different scales depending on the triangle configuration, 
but also the (shape dependent) redshift space distortion parameter 
av (eq. 6 ) has been calibrated only for these two configurations. 

To cope with so many triangles, we define a new estimator for 
fei and &2 in the Appendix. Although the estimator is not minimum- 
variance, it is unbiased and has the big advantage that it allows us 
to analyze many triangles, thus increasing the signal to noise. The 
estimator does not give error bars; these are obtained by Monte 
Carlo simulation from 16 mock 2dFGRS catalogues (see Section 
4). In Fig. |I] we show the measured (redshift-space) 2dFGRS bis¬ 
pectrum from the SGP and NGP for the two chosen triangle config¬ 
urations. The dotted line shows the perturbation theory prediction 
for bi = 1 , 62 = 0 while the dashed line shows the shot noise con¬ 
tribution. For the bispectrum, the shot noise contribution becomes 
dominant around k = 0.5 IiMpc -1 . 


4 MOCK CATALOGUES AND TESTS 

We have used 16 mock catalogues from a Hubble Volume N-body 
simulation with S2 m = 0.3, I2 a = 0.7 (‘ACDM’ model), with 
the same selection function as the 2dF galaxy redshift survey (Cole 
et al. 1998) Q. This includes both the radial selection function and 
an angular mask that reflects the varying completeness of the sur¬ 
vey in February 2001. The catalogues contain mock galaxies whose 

t See _ 

tittp: //star-www. dur. ac. uk/~cole/mocks/hubble . html 

We find that 16 mock catalogs are sufficent to estimate the error bars. The 
choice of the cosmological model is not important for the error estimate, 
however this choice of the cosmological model turns out to be not too far 
away from the model recovered ”a posteriori”. 


positions are determined according to the prescription described in 
Cole et al. (1998). This is a 2-parameter exponential model based 
on the final density field, i.e. it does not conform to our assump¬ 
tion of equation (BI). We will show that nevertheless the bispectrum 
method recovers the bias parameter 61 , defined by the square root 
of the ratio of the galaxy and matter power spectra^. This is the cru¬ 
cial test, since it is this ratio that we wish to determine, as we can 
then translate the galaxy power spectrum to the underlying mass 
spectrum. For the simulated galaxies in the Hubble volume as a 
whole the square root of the power spectrum ratio varies between 
about 0.9 at large scales and 0.75 at k = 0.5/iMpc -1 . Over the 
scales probed by our bispectrum triangles, the bias is roughly 0 . 8 , 
but with an uncertainty which ultimately limits our error determi¬ 
nation. The power spectrum of the simulated 2dFGRS catalogues 
varies a little, so we estimate b\ = ( P g /P) 1 ^ 2 individually for each 
catalogue (and individually for SGP and NGP). from the wavenum¬ 
ber range 0.1 — 0. 5h Mpc “ 1 . This sets (3 for each sample, and we fit 
the pairwise velocity dispersion o v individually by requiring a good 
fit to the redshift-space power spectrum, using the real-to-redshift 
mapping of equation (9). 

We then analyse the set of triangles for each simulation, as de¬ 
tailed above, and in Fig. [ 2 ] we show the error in the bias from 16 
simulations in the SGP and NGP. The average bias in the estima¬ 
tor is consistent with zero for the SGP: —0.01 ± 0.03, but shows 
a small bias of 0.10 ± 0.04 for the NGP^j. Fig. ^ shows that the 
approximations made in the analysis (e.g. the functional form of 
the bias, the window function, the distant observer approximation 
etc.) do not significantly bias the result. We will use the sample 
rms of 0.11 for the SGP and 0.16 for the NGP as the errors in the 
analysis of the 2dFGRS. We also carried out the same analysis on 
16 mock catalogues of the SGP obtained from the Hubble volume 
simulation for a rCDM model. The true underlying bias parameter 
is correctly recovered with a 20% error (b = 1.7 ± 0.3). The er¬ 
ror in the natural parameter 1 /b\ is comparable for the ACDM and 
rCDM models, but the value of b\ itself is larger in the latter case. 


t For a more general discussion of definitions of bias, see Lahav et al. 
( 2001 ) 

§ This is understandable since the NGP is smaller than the SGP. thus the 
effects of the convolution with the window are more important. 
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Figure 3. l/b\ recovered with the bispectrum method versus the underly¬ 
ing (true) 1/bi = -\JP/P g for 16 mock SGP simulations for the rCDM 
and ACDM models. l/b\ is the natural quantity in the analysis of the bis¬ 
pectrum (see equation W). Note that the 2dFGRS has data in the NGP and 
SGP, reducing the error bar compared to these mock catalogues. 
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In Fig. |?] we show bi recovered with the bispectrum method versus 
the underlying (true) bi = yjP g /P for 16 mock SGP simulations 
for the rCDM and ACDM models. 


5 RESULTS 

We analyse the catalogue as compiled in February 2001 (i.e. the 
same catalogue as in Percival et al. 2001), and we analyse the NGP 
and SGP regions separately; we excise the random fields entirely. 
This leaves us with 75792 and 51862 galaxies in the SGP and NGP 
respectively. The two survey regions give us a useful consistency 
check. 

We initially present results fixing (3 = 0.43 and a p = 385 /Ho 
kms -1 (3.85 hr 1 Mpc). These are the best-fitting values from the 
analysis of the 2dFGRS redshift-space correlation function (Pea¬ 
cock et al. 2001; Norberg et al. 2001a; see also Tegmark, Hamilton 

6 Xu 2001). Note that this value of /3 involves a bias parameter that 
is defined somewhat differently from ours, in terms of the correla¬ 
tion function. This bias parameter will coincide with our bi only if 
certain assumptions hold, such as if the overdensity in galaxies is 
everywhere bi times the mass overdensity. In due course, (3 can be 
determined by analysis of the 2dFGRS power spectrum itself, but 
for the time being we assume that the two bias parameters are the 
same. Finally, we will marginalise over f3 and a p . 


5.1 Bias parameters 

It is difficult to find a method to display in a single figure all of 
the information from the 80 million triangles used: for this reason 
we display only one configuration and for the SGP alone; adding 
other configurations (or the NGP) would complicate the figure. Fig. 

shows the SGP data for configurations with two wavevectors of 
common length, along with the perturbation theory predictions for 
different combinations of the linear and quadratic bias parameters. 
The dashed and dot-dashed lines are the perturbation theory predic¬ 
tion for bi = 1.3, b 2 = 0 and bi = 1.0, 62 — 0.5. The error bars 
come from the mock catalogues, and the offset between the centres 
of the error bars and the points shows the bias in the estimator. 


Fixing the values of (3 and a p , and including all of the trian¬ 
gles in the SGP gives as a raw result b 1 = 1.06, b 2 = 0.01. The 
NGP linear bias is slightly higher: bi = 1.12, 62 = —0.07. Adjust¬ 
ing these results with the estimator bias obtained from the mock 
catalogues (Section 4), and associating an error from the mock cat¬ 
alogues gives 

bi = 1.07 ±0.11 (SGP) (14) 

bi = 1.02 ±0.16 (NGP) 

and a combined minimum-variance weighted result of 

br = 1.05 ±0.09. (15) 

Note that the estimator bias does not significantly alter the results: 
ignoring it increases the estimate by 0.03, much less than the sta¬ 
tistical error. 

Our estimate of the quadratic bias parameter from the NGP 
and SGP 2dFGRS is 

b 2 = -0.02 ± 0.07, (16) 

with SGP and NGP individually giving b 2 = 0.01 ± 0.09 and 
—0.07 ±0.11 respectively. 

Thus we see that the 2dFGRS galaxies are perfectly consis¬ 
tent with tracing the underlying mass distribution on these scales 
(wavelengths A = 2ir/k = 13 to 63 h -1 Mpc) i.e. bi is consistent 
with unity, and b 2 with zero. 

The results depend mildly on the values of j3 and cr p used. 
Marginalising over the distribution of these quantities estimated 
from the redshift-space correlation function (Peacock et al. 2001), 
the final results are virtually unchanged: bi = 1.04 ± 0.06, 
b 2 = —0.054 ± 0.04; adding these errors in quadrature with those 
of ( |l~5| ) gives our final estimates: 

bi = 1.04 ± 0.11 (17) 

b 2 = -0.054 ± 0.08. 


5.2 Luminosity-dependent and scale-dependent bias 

The 2dFGRS exhibits luminosity-dependent clustering (Norberg 
et al. 2001a; Norberg et al. 2001b), so it is reasonable to ask in 
what sense the galaxies in the 2dFGRS are unbiased. We have con¬ 
sistently used all the galaxies in the 2dFGRS NGP and SGP re¬ 
gions, to determine (3 and the bias parameters. With the weighting 
scheme employed to measure (3 and b 1 , the weighted mean lumi¬ 
nosity is 1.9L*. From studies of the correlation function, Norberg 
et al. ( 2001 a) found a luminosity dependence in the relative bias 
of 2dFGRS galaxies: b/b* = 0.85 ± 0.15L/L*. If we use this 
relation to adjust our recovered bias to apply to L* galaxies, we 
find they could be slightly anti-biased, but they are still consistent 
with bi = 1 (bi* = 0.92 ± 0.11 assuming b\ = b). The 2dFGRS 
galaxies, as a population, trace the mass extremely well. 

On scale-dependence of the bias, it is worth noting that theo¬ 
retical arguments (e.g. Mann, Peacock & Heavens 1998; Peacock 
& Smith 2000; Seljak 2000; Benson et al. 2000; Blanton et al. 
2000; Berlind & Weinberg 2001) suggest that bias is expected to be 
constant on large scales (larger than a few b _1 Mpc) and a scale- 
dependent bias is plausible on intermediate scales, but the scale 
dependence is expected to be weak. We find no evidence of scale- 
dependent bias (see Fig. 4). Both scale-dependence and luminosity- 
dependence may be explored in detail when the catalogue is com¬ 
plete. 
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Figure 4. Ratio of the average measured bispectrum and the average per¬ 
turbation theory predictions, relative to the bispectrum for a fiducial un¬ 
biased model (-Bfid)- Dashed line b\ = 1 . 3,62 — 0, dot-dashed line 
61 = 1.0,62 = 0.5. To produce this figures only the SGP data were 
used and only configurations with two wavevectors of common length. This 
means that only 12 million triangles were used from the total 80 millions. 
Inclusion of the remainder excludes both models at high confidence. The er¬ 
ror bars are obtained via Monte Carlo from the 16 N-body simulations, and 
are placed centrally on the mean of the estimates from the mock catalogues. 
This illustrates the level of bias in the estimator. The figure also shows that 
there is no evidence of scale dependent bias. 
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5.3 The matter density of the Universe 

Analysis of the redshift-space correlation function allows one to 
estimate j3 = Ulf /b\ (Peacock et al. 2001), and this can be com¬ 
bined with our determination of i>i to estimate the matter density 
parameter fl m . A strength of the result is that it is obtained from 
the 2dFGRS alone, independently of all other datasets. Following 
the same procedure as in the previous subsection, we marginalise 
over the uncertainty in j3 and a p , to get: 

Qm = 0.27 ±0.06. (18) 

The small size of the error bar is worth remarking on. For each pair 
of values of /3 and a p , we obtain an estimate of b i. The estimates 
of f3 and bi are slightly anti-correlated, which leads to a slightly 
smaller scatter in Q m than one would expect from the errors on /? 
and bi. 

This result is consistent with other recent determinations by 
other methods such as combining CMB with large-scale structure 
or supernova measurements (e.g. Percival et al. 2001; Jaffe et al. 
2001; Efstathiou et al. 2001) and from early weak lensing measure¬ 
ments (Floekstra, Yee & Gladders 2001). 


5.4 Comparison with previous work 

Previous work has concentrated on using cumulants, although there 
have been two analyses (Feldman et al. 2001; Scoccimarro et al. 
2001) of the bispectrum of infrared-selected galaxies observed with 
the Infrared Astronomy Satellite (IRAS; Fisher et al. 1995; Saun¬ 
ders et al. 2000). The IRAS samples are much smaller than the cur¬ 
rent 2dFGRS, and are also rather shallow, so shot noise and the ra¬ 
dial nature of the redshift distortion is more severe than for the 2dF- 
GRS. Nevertheless, bias estimates have been made, and reported 
with remarkably small error bars. The most accurate quoted values 
are from Feldman et al. (2001) 1/foi = 1.2^° ,® for the PSCz sur¬ 


vey (Saunders et al. 2000). This value of the linear bias parameter 
is consistent with our determinations, since IRAS galaxies have a 
power spectrum which is lower than optically selected galaxies, by 
a factor ~ 1.3 2 on the relevant scales (Peacock & Dodds 1994; 
Seaborne et al. 1999). 

Studies of the skewness, kurtosis and higher-order moments 
of the optical galaxy distribution (e.g. Gaztanaga 1994; Szapudi 
& Szalay 1997; Hui & Gaztanaga 1999; Hoyle, Szapudi & Baugh 
2000) have also shown consistency with a linear bias of unity, but 
here we are able to derive both the linear bias and the quadratic bias 
simultaneously. In addition, our present results are more accurate 
and convert these general indications of a low degree of bias into a 
strong constraint on theoretical models. 


6 SUMMARY AND CONCLUSIONS 

We have demonstrated through analysis of the bispectrum of the 
2dFGRS that the optically-selected galaxies of the sample trace 
the matter density extremely well on large scales (Fourier modes 
with 0.1 < k < 0.5 h Mpc -1 that approximately correspond to 
30 < r < 5/i -1 Mpc). Specifically, the linear bias parameter is 
very close to unity, and the quadratic (nonlinear) bias is very close 
to zero. Ironically, this is exactly the assumption which used to be 
made decades ago in large-scale structure analysis, but which was 
questioned when the concept of biased galaxy formation was intro¬ 
duced (e.g. Kaiser 1984; Peacock & Heavens 1985; Bardeen et al. 
1986; Dekel & Rees 1994). Theoretical arguments suggest that an 
initial bias at formation approaches unity with time, provided that 
galaxies are neither created nor destroyed (Fry 1996), and that the 
Universe does not become curvature- or vacuum-dominated in the 
meantime (Catelan, Matarrese & Porciani 1998). In any case, these 
assumptions will clearly fail at some level. Note that the effective 
depth of the survey is z = 0.17 and our measurement should be 
interpreted as the bias and S2 m at this epoch (for additional dis¬ 
cussion see Lahav et al. 2001). Because of a tendency for bias to 
approach unity with time, we do not by this measurement rule out a 
significant bias of galaxies at formation time, but the unbiased na¬ 
ture of the galaxies today puts a significant constraint on theoretical 
models of galaxy formation. 

Currently, we find no evidence of scale-dependent bias (see 
Fig.^). The size of the survey at the present time does not allow us 
to place strong constraints on scale dependence or luminosity de¬ 
pendence of the bias parameter, or on the nature of biasing (e.g. de¬ 
terministic, stochastic, Eulerian, Lagrangian etc.), but these issues 
will be explored with the completion of the survey. At some level 
nonlinear bias must appear on small scales, from the morphology- 
density relation (Dressier 1980; Hashimoto & Oemler 1999), but 
this is probably on too small scales for the perturbative method of 
this paper to be valid. 

Our measurement of the matter density of the Universe = 
0.27 ± 0.06 should be interpreted as f) m at the effective redshift 
of the survey. The extrapolation at z = 0 is model-dependent, 
but the changes this correction introduces are below the quoted 
error-bars for reasonnable choices of model. Our measurement is 
in agreement with other independent determinations such as cosmic 
microwave background together with Hubble constant constraints 
(e.g. Jaffe et al. 2001; Efstathiou et al. 2001; Freedman et al. 2001) 
which also give comparable error bars. 

It is worth re-emphasising that the analysis presented here re¬ 
lies only on the 2dFGRS dataset. It is also important that we not 
only conclude that the linear bias is essentially unity, but also the 
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quadratic (nonlinear) bias term is constrained to be very close to 
zero. Taken together, these measurements argue powerfully that the 
2dFGRS galaxies do indeed trace the mass on large scales. In ad¬ 
dition to our findings, joint analysis of the 2dFGRS and CMB data 
(Lahav et al. 2001) also supports our conclusion that the 2dFGRS 
galaxies are unbiased. Different methods of measuring linear bias 
require different assumptions, and it is remarkable that such differ¬ 
ent methods agree on the basic cosmological model. 
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APPENDIX 


Here we outline the new estimator for the bias parameters. Al¬ 
though not optimal, it allows many millions of triangles to be anal¬ 
ysed. 

The likelihood of ci = 1 /6i and C 2 = & 2/&1 from each tri¬ 
angle (labelled by 7) is degenerate, since it constrains only a linear 
combination of ci and C2, namely 


13 -y — R'y C\ ~\~ S-yC2 F 2 7 


(19) 


where the expressions for P 7 , S-y, and T 7 are at the end of this 
Appendix. The bispectrum estimate B 1 has an intrinsic statistical 
error cr 7 as discussed e.g. in Matarrese, Verde & Heavens (1997). 
In computing the bias parameter, we wish to weight the determina¬ 
tion obtained from each triangle by the inverse of the variance of 
the corresponding bispectrum. Since this is a cumbersome expres¬ 
sion (see Matarrese, Verde & Heavens 1997; Verde et al. 1998) we 
approximate this by its dominant Gaussian term, 

| 2 ) = ( Pg 1 + Ps n)( Pg 2 4 " Psn) (P 9 3 + Psn) (20) 


where P g i = P, (fci) etc. and Psn is the shot noise. Inclusion of 
higher-order terms leaves the results practically unchanged for this 
dataset, and adds considerably to the computing time. If we now 
assume that this error is Gaussian distributed we can immediately 
see that the likelihood contours will be lines in the ci — C2 plane: 


ln£(ci, C 2 ) oc — In < 7 . 


(P 7 — P 7 C1 -f- S^C 2 4 “ P 7 ) 2 
2<t 7 


( 21 ) 


This should give an unbiased a posteriori probability for ci and 
C2, if we assume uniform priors. We define a non-optimal esti¬ 
mator by simply multiplying these likelihoods together, ignoring 
the correlations between different triangles, forming what we term 
the pseudolikelihood. The maximum of the pseudolikelihood is still 
asymptotically unbiased. The error, however, cannot be determined 
internally; we estimate it by Monte Carlo methods i.e. from the 
dispersion of ci and C 2 estimates from 16 mock catalogues. This 
procedure also allows us to see if the estimator is biased or not. 
Moreover, by estimating the errors via the Monte Carlo method, our 
final estimates do not depend on the initial assumption of Gaussian 
likelihood. 

The assumption of Gaussian likelihood however, makes the 
maximum of the pseudolikelihood calculable analytically, yielding 
the following estimates for ci and C2 : 


2 A 1 A 22 — A 2 A 12 

A 2 12 ~ 4 A n A 


2 A 2 A 11 — A 1 A 12 

A \ 2 — 4A11A22 


(23) 


where 
Fin = 

A22 = 

A! = 
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(24) 

(25) 

(26) 

(27) 

(28) 


7 

We can also, if desired, compute ci on the assumption that C 2 = 0: 


c 1 = - 


Ai 

2An 


(C2 = 0 ). 


(29) 


The expressions for P 7 , S 7 , T 7 can easily be obtained from 
the expression for the bispectrum of Verde et al. (1998); Sections 
2.3 and 2.4. We report them here for completeness. To obtain their 
corresponding real space quantities just set a v = 0 and (3 = 0. 


P 7 = {[J(k 1 ,k 2 ) + M 2 /?A'(k 1 ,k 2 )] 2 P 9 (fc 1 )P 9 (fc 2 ) ( 30 ) 

x (1 +/?Mi)(l + Pul) +cyc.} P> 3 

where /r = —fj ,3 for the term explicitly written and the expression 
for the kernel K can be found e.g. in Catelan & Moscardini (1994), 


S-y = [(1 + Pn\){l + Pnl)Pg(k 1 )Pg(k 2 ) + eye ,]P >3 (31) 


I M 1 M 2/3 + + M 2 ) + /?M 1 M 2 (32) 


/P ( 2 ki , 2 fa \ 


2 P g (k 1 )P g (k 2 ) 


x (1 + /3Mi)(1 + dff) + eye.} P> 3 

I32 


Pfki 


(1 + /?M 2 ' 2 


2 / 0 7 . 2 ,, 2 i + C y C - 


(1 + ai/2klf{) 


1 (-\ 2\ -^31 

7 -h (1 — a ) —, 

J 33 I 33 


where P 3 denotes the damping term due to the incoherent small 
scale velocity dispersion (see eq. 6 ): 


D 3 = 


x 




^ | ofklal^ 
-1/2 


(33) 
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